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DEC 0 3 2003 !SJ Application Serial No. 09/768,781 

Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN TRANSPORTER PROTEINS... -v 
Attorney Docket No. CL001057CIP 

FORM 1 (cDNA sequence provided) : 

1 ATGGACAGAG TTTATGAAAT TCCTGAGGAG CCAAATGTGG ATCCGGTTTC ^IQc <Oy 



51 ATCTCTGGAG GAAGATGTCA TCCGTGGAGC CAACCCCCGA TTTACTTTTC 
101 CATTTAGCAT CCTTTTCTCC ACCTTTTTGT ACTGTGGGGA GGCTGCATCT 



151 GCTTTGTACA TGGTTAGAAT CTATCGAAAG AATAGTGAAA CTTACCGGAT 

201 GACATACACC TTTTCTTTCT TTATGTTTTC ATCCATTATG GTCCAGTTGA V r 



251 CCCTCATTTT TGTCCACAGA GATCTAGCCA AAGATAAACC GCTATCATTA 0/2r> 



301 TTTATGCATC TAATCCTCTT GGGACCTGTT ATCAGATGTT TGGAGGCCAT 

351 GATTAAGTAC CTCACACTGT GGAAGAAAGA GGAGCAGGAG GAGCCCTATG 

401 TCAGCCTCAC CCGAAAGAAG ATGCTAATAG ATGGCGAGGA GGTGCTGATA 

451 GAATGGGAGG TGGGCCACTC CATCCGGACC CTGGCTATGC ACCGCAATGC 

501 CTACAAACGT ATGTCACAGA TCCAAGCCTT CCTGGGCTCA GTGCCCCAGC 

551 TGACCTATCA GCTCTATGTG AGCCTGATCT CTGCAGAGGT TCCCCTGGGT 

601 AGAGTTGTGC TAATGGTATT TTCCCTGGTA TCTGTCACCT ATGGGGCCAC 

651 CCTTTGCAAT ATGTTGGCTA TCCAGATCAA GTACGATGAC TACAAGATTC 

701 GCCTTGGGCC ACTAGAAGTC CTCTGCATCA CCATCTGGCG GACATTGGAG 

751 ATCACTTCCC GCCTCCTGAT TCTGGTGCTC TTCTCAGCCA CTTTGAAATT 

801 GAAGGCTGTG CCCTTCCTAG TGCTCAACTT CCTGATCATC CTCTTTGAGC 

851 CCTGGATTAA GTTCTGGAGA AGTGGTGCCC AGATGCCCAA TAACATTGAG 

901 AAAAACTTCA GCCGGGTCGG CACTCTGGTG GTCCTGATTT CAGTCACCAT 

951 CCTCTATGCT GGCATCAACT TCTCTTGCTG GTCAGCTTTG CAGTTGAGGT 

1001 TGGCAGACAG AGATCTCGTC GACAAAGGGC AGAACTGGGG ACATATGGGC 

1051 CTGCACTATA GTGTGAGGTT GGTAGAGAAT GTGATCATGG TCTTGGTTTT 

1101 TAAGTTCTTT GGAGTGAAAG TGTTACTGAA TTACTGTCAT TCCTTGATTG 

1151 CCTTGCAGCT CATTATTGCT TATCTGATTT CCATTGACTT CATGCTCCTT 

1201 TTCTTCCAGT ACTTGCATCC ATTGCGCTCA CTCTTCACCC ATAATGTAGT 

1251 AGACTACCTC CATTGTGTCT GCTGTCACCA GCACCCTCGG ACCAGGGTTG 

1301 AGAACTCAGA GCCACCCTTT GAGACTGAAG CAAGGCAAAG TGTTGTCTGA 

(SEQ ID NO:l) 



FEATURES : 

Start Codon: 1 

Stop Codon: 1348 

3'UTR: 1351 



FORM 2 (transcript 
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ATGAACACAA 


51 


TTATGAAATT 


101 


AAGATGTCAT 


151 


CTTTTCTCCA 


201 


GGTTAGAATC 


251 


TTTCTTTCTT 


301 


GTCCACAGAG 


351 


AATCCTCTTG 


401 


TCACACTGTG 


451 


CGAAAGAAGA 


501 


GGGCCACTCC 


551 


TGTCACAGAT 


601 


CTCTATGTGA 


651 


AATGGTATTT 


701 


TGTTGGCTAT 


751 


CTAGAAGTCC 


801 


CCTCCTGATT 


851 


CCTTCCTAGT 


901 


TTCTGGAGAA 


951 


CCGGGTCGGC 


1001 


GCATCAACTT 


1051 


GATCTCGTCG 


1101 


TGTGAGGTTG 


1151 


GAGTGAAAGT 


1201 


ATTATTGCTT 



squence provided) : 
GACCACAACA TTCAGAAAGA 
CCTGAGGAGC CAAATGTGGA 
CCGTGGAGCC AACCCCCGAT 
CCTTTTTGTA CTGTGGGGAG 
TATCGAAAGA ATAGTGAAAC 
TATGTTTTCA TCCATTATGG 
ATCTAGCCAA AGATAAACCG 
GGACCTGTTA TCAGATGTTT 
GAAGAAAGAG GAGCAGGAGG 
TGCTAATAGA TGGCGAGGAG 
ATCCGGACCC TGGCTATGCA 
CCAAGCCTTC CTGGGCTCAG 
GCCTGATCTC TGCAGAGGTT 
TCCCTGGTAT CTGTCACCTA 
CCAGATCAAG TACGATGACT 
TCTGCATCAC CATCTGGCGG 
CTGGTGCTCT TCTCAGCCAC 
GCTCAACTTC CTGATCATCC 
GTGGTGCCCA GATGCCCAAT 
ACTCTGGTGG TCCTGATTTC 
CTCTTGCTGG TCAGCTTTGC 
ACAAAGGGCA GAACTGGGGA 
GTAGAGAATG TGATCATGGT 
GTTACTGAAT TACTGTCATT 
ATCTGATTTC CATTGGCTTC 



ACCTCGACAA TGGACAGAGT 
TCCGGTTTCA TCTCTGGAGG 
TTACTTTTCC ATTTAGCATC 
GCTGCATCTG CTTTGTACAT 
TTACTGGATG ACATACACCT 
TCCAGTTGAC CCTCATTTTT 
CTATCATTAT TTATGCATCT 
GGAGGCCATG ATTAAGTACC 
AGCCCTATGT CAGCCTCACC 
GTGCTGATAG AATGGGAGGT 
CCGCAATGCC TACAAACGTA 
TGCCCCAGCT GACCTATCAG 
CCCCTGGGTA GAGTTGTGCT 
TGGGGCCACC CTTTGCAATA 
ACAAGATTCG CCTTGGGCCA 
ACATTGGAGA TCACTTCCCG 
TTTGAAATTG AAGGCTGTGC 
TCTTTGAGCC CTGGATTAAG 
AACATTGAGA AAAACTTCAG 
AGTCACCATC CTCTATGCTG 
AGTTGAGGTT GGCAGACAGA 
CATATGGGCC TGCACTATAG 
CTTGGTTTTT AAGTTCTTTG 
CCTTGATTGC CTTGCAGCTC 
ATGCTCCTTT TCTTCCAGTA 
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Application Serial No. 09/768,781 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN TRANSPORTER PROTEINS... 
Attorney Docket No. CL001057CIP 

CTTGCATCCA TTGCGCTCAC TCTTCACCCA TAATGTAGTA GACTACCTCC 
ATTGTGTCTG CTGTCACCAG CACCCTCGGA CCAGGGTTGA GAACTCAGAG 
CCACCCTTTG AGACTGAAGC AAGGCAAAGT GTTGTCTGA (SEQ ID NO: 4) 



HOMOLOGOUS PROTEINS: 

Top BLAST Hits: 

gi|6502963|gb|AAF14527.l|AF155511_l (AF155511) KX antigen [Mus . 
gi | 10835267 | ref | NP_066569 . 1 | Kell blood group precursor (McLeod. 
gi j 2135606 | pir | j 139294 McLeod syndrome-associated protein XK - . 
gij 3183551 |sp|P5181l|XK_HUMAN MEMBRANE TRANSPORT PROTEIN XK (KX. 
gi j 4759330 j ref |NP_004668 . 1 | Testis-specif ic XK- related protein . 

BLAST to dbEST: 



gi | 1891549 /dataset=dbest /taxon=9606 
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EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 
gi | 1891549 Germinal center B cells 

Expression information from PCR-based tissue screening panels: 
Mixed tissue 
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Inventors: Gennady MERKULOV et al. x 
Title: ISOLATED HUMAN TRANSPORTER PROTEINS... ^ <4u L>v 

Attorney Docket No. CL001057CIP <VV 

FORM 1: / 

1 MDRVYEIPEE PNVDPVSSLE EDVIRGANPR FTFPFSILFS TFLYCGEAAS ^ ^z? X 

51 ALYMVRIYRK NSETYRMTYT FSFFMFSSIM VQLTLIFVHR DLAKDKPLSL 7 , s V)? 

101 FMHLILLGPV IRCLEAMIKY LTLWKKEEQE EPYVSLTRKK MLIDGEEVLI \ \ 

151 EWEVGHSIRT LAMHRNAYKR MSQIQAFLGS VPQLTYQLYV SLISAEVPLG ■ 

201 RWLMVFSLV SVTYGATLCN MLAIQIKYDD YKIRLGPLEV LCITIWRTLE tf? 

251 ITSRLLILVL FSATLKLKAV PFLVLNFLII LFEPWIKFWR SGAQMPNNIE 

301 KNFSRVGTLV VLISVTILYA GINFSCWSAL QLRLADRDLV DKGQNWGHMG 

351 LHYSVRLVEN VIMVLVFKFF GVKVLLNYCH SLIALQLIIA YLISIDFMLL 

401 FFQYLHPLRS LFTHNWDYL HCVCCHQHPR TRVENSEPPF ETEARQSW (SEQ ID NO: 2) 

FORM 2: 

1 MNTRPQHSER TSTMDRVYEI PEEPNVDPVS SLEEDVIRGA NPRFTFPFSI 

51 LFSTFLYCGE AASALYMVRI YRKNSETYWM TYTFSFFMFS SIMVQLTLIF 

101 VHRDLAKDKP LSLFMHLILL GPVIRCLEAM I KYLTLWKKE EQEEPYVSLT 

151 RKKMLIDGEE VLIEWEVGHS IRTLAMHRNA YKRMSQIQAF LGSVPQLTYQ 

201 LYVSLISAEV PLGRWLMVF SLVSVTYGAT LCNMLAIQIK YDDYKIRLGP 

251 LEVLCITIWR TLEITSRLLI LVLFSATLKL KAVPFLVLNF LIILFEPWIK 

301 FWRSGAQMPN NIEKNFSRVG TLWLISVTI LYAGINFSCW SALQLRLADR 

351 DLVDKGQNWG HMGLHYSVRL VENVIMVLVF KFFGVKVLLN YCHSLIALQL 

401 IIAYLISIGF MLLFFQYLHP LRSLFTHNW DYLHCVCCHQ HPRTRVENSE 

451 PPFETEARQS W (SEQ ID NO: 5) 



FEATURES : 

Functional domains and key regions: 
[1] PDOC00001 PS00001 ASN_GL YCOS YLAT I ON 
N-glycosylation site 

Number of matches: 2 

1 302-305 NFSR (SEQ ID NO: 7) 

2 323-326 NFSC (SEQ ID NO: 8) 

[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP- dependent protein kinase phosphorylation site 

Number of matches: 2 

1 59-62 RKNS (SEQ ID NO: 9) 

2 169-172 KRMS (SEQ ID NO: 10) 

[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches : 6 

1 64-66 TYR 

2 137-139 TRK 

3 157-159 SIR 

4 252-254 TSR 

5 264-266 TLK 

6 354-356 SVR 

[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 17-20 SSLE (SEQ ID NO: 11) 

2 18-21 SLEE (SEQ ID NO: 12) 

3 431-434 TRVE (SEQ ID NO: 13) 
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[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE ~ V >foi ^ 



Tyrosine kinase phosphorylation site ^22^ 



126-133 KEEQEEPY (SEQ ID NO: 14) 

[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches : 4 

1 215-220 GATLCN (SEQ ID NO: 15) 

2 321-326 GINFSC (SEQ ID NO: 16) 

3 343-348 GQNWGH (SEQ ID NO: 17) 

4 350-355 GLHYSV (SEQ ID NO: 18) 

[7] PDOC00029 PS00029 LEUCINE_ZIPPER 
Leucine zipper pattern 

100-121 LFMHLILLGPVIRCLEAMIKYL (SEQ ID NO: 19) 



Membrane spanning structure and domains; 
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%\ Application Serial No. 09/768,781 % s /"V 

. Inventors: Gennady MERKULOV et al. 'fa /CS 

_ Title: ISOLATED HUMAN TRANSPORTER PROTEINS... C <y.. X <^ \/^ 

^ffif Attorney Docket No. CL001057CIP #g *> 

^^^SffiS^LAST Alignment to Top Hit: <$? ^ 



>gi|6502963|gb|AAF14527.l|AF155511_l (AF155511) KX antigen [Mus 



musculus] Length = 446 

Score = 366 bits (930), Expect = e-106 

Identities = 179/411 (43%), Positives = 265/411 (63%), Gaps = 11/411 (2%) 

Query: 33 FPFSILFSTFLYCGEAASALYMVRIYRKNSETYRMTYTFSFFMFSSIMVQLTLIFVHRDL 92 

FP S++ S FL+ E A+ALY+ YR + T F + +VQ TL+FVHRDL 

Sbjct: 3 FPASVIASVFLFVAETAAALYLSSTYRSAGDRMWQVLTLLFSLMPCALVQFTLLFVHRDL 62 

Query: 93 AKDKPLSLFMHLILLGPVIRCLEAMIKYLTLWKKEEQEEPYVSLTRKKMLI-DGEEVLIE 151 

++D+PL+L MHL+ LGP+ RC E Y + ++ EEPYVS+T+K+ + DG +E 
Sbjct: 63 SRDRPLALLMHLLQLGPLYRCCEVFCIYC QSDQNEEPYVSITKKRQMPKDGLSEEVE 119 

Query: 152 WEVGHSIRTLAI^RNAYKRMSQIQAFLGSVPQLTYQLYVSLISAEVPLGRWLMVFSLVS 211 

EVG + L HR+A+ R S IQAFLGS PQLT QLY++++ + GR +M SL+S 
Sbjct: 120 KE VGQ AEGKL I THRS AFSRAS V I QAFLGS APQLTLQLY I TVLEQNI TTGRC F I MTLSLLS 179 

Query: 212 VTYGATLCNMLAI QI KYDDYKI RLGPLEVLC I TI WRTLE I TSRLLI LVLFS ATLKLKAVP 271 

+ YGA CN+LAI+IKYD+Y++++ PL +CI +WR+ EI +R+++LVLF++ LK+ V 
Sbjct: 180 I VYGALRCNI LAI KI KYDE YEVKVKPLAYVC I FLWRS FE I ATRVI VLVLFTSVLKI WWA 239 

Query: 272 FLVLNFLIILFEPWIKFWRSGAQMPNNIEKNFSRVGTLWLISVTILYAGINFSCWSALQ 331 

+++NF PWI FW SG+ P NIEK SRVGT +VL +T+LYAGIN CWSA+Q 

Sbjct: 240 VILVNFFSFFLYPWIVFWCSGSPFPENIEKALSRVGTTIVLCFLTLLYAGINMFCWSAVQ 299 

Query: 332 LRIJUDRDLVDKGQIWGHMGLHYSVRLVENVIMVLVFKFFGVKVLLNYCHSLIALQLIIAY 391 

L++ + +L+ K QNW + ++Y R +EN +++L++ FF + + C L+ LQL+I Y 
Sbjct: 300 LKIDNPELISKSQNWYRLLIYYMTRFIENSVLLLLWYFFKTDIYMYVCAPLLILQLLIGY 359 

Query: 392 L I S I D FMLLF FQYLH PLRS LFTHNWD YLHCVCCHQHPRTRVENSEP 438 

I FML+F+Q+ HP + LF+ +V + L C C R ++SEP 

Sbjct: 360 CTGILFMLVFYQFFHPCKKLFSSSVSESFRALLRCACWSS LRRKSSEP 407 

{SEQ ID NO: 6) 

ALIGNMENT OF FORM 1 AND FORM 2: 

>FORM 2 

Length = 462 (Length of FORM 1 = 449) 
Score = 900 bits (2301), Expect = 0.0 
Identities = 447/449 (99%), Positives = 447/449 (99%) 

MDRVYEIPEEPNVDPVSSLEEDVIRGANPRFTFPFSILFSTFLYCGEAASALYMVRIYRK 60 
MDRVYEIPEEPNVDPVSSLEEDVIRGANPRFTFPFSILFSTFLYCGEAASALYMVRIYRK 



NSETY MTYTFS FFMFSS IMVQLTLI FVHRDLAKDKPLSLFMHLI LLGPVI RCLEAMI KY 



LTLWKKEEQEEPYVSLTRKKMLIDGEEVLIEWEVGHSIRTLAMHRNAYKRMSQIQAFLGS 



VPQLTYQLYVSLISAEVPLGRWLMVFSLVSVTYGATLCNMLAIQIKYDDYKIRLGPLEV 



LCI TI WRTLE ITSRLLI LVLFS ATLKLKAVPFL VLNFLI I LFEPWI KFWRSGAQMPNNI E 



FORM 


1: 


1 


FORM 


2: 


14 


FORM 


1: 


61 


FORM 


2: 


74 


FORM 


1: 


121 


FORM 


2: 


134 


FORM 


1: 


181 


FORM 


2: 


194 


FORM 


1: 


241 


FORM 


2: 


254 



FIGURE 2C 




Application Serial No. 09/768,781 
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FORM 1: 301 KNFSRVGTLWLISVTILYAGINFSOTSALQLRLADRDLVDKGQNV/GHMGLHYSVRLVEN 360 Vj, 

KNFSRVGTLWLISVTILYAGINFSCWSAI^LRLADRDLVDKGQNWGHMGLHYSVRLVEN <^ <% 

FORM 2: 314 KNFSRVGTLWLISVTILYAGINFSCWSALQLRLADRDLVDKGQNWGHMGLHYSVRLVEN 373 \£ 

FORM 1: 361 VINmjVFKFFGVKVLLNYCHSLIALQLIIAYLISIDFMLLFFQYLHPLRSLFTHNVVDYL 420 ^ ^ 

VIMVLVFKFFGVKVLLNYCHSLIALQLIIAYLISI FMLLFFQYLHPLRSLFTHNWDYL <g> \ 

FORM 2: 374 VIMVLVFKFFGVKVLI^OTSLIALQLIIAYLISIGFMLLFFQYI^PLRSLFTHNVVX)YL 433 ^ 

FORM 1: 421 HCVCCHQHPRTRVENSEPPFETEARQSW 449 (SEQ ID NO: 2) <g> 
HCV CCHQHPRTRVENS E P P FETE ARQS W ^ 



FORM 2: 434 HCVCCHQHPRTRVENSEPPFETEARQSW 462 (Position 14 to 462 of SEQ ID NO: 5) 
Hmmer search results (Pfam) : 

Model Description Score E-value N 

CE00306 E00306 Membrane_transport_protein_XK 390.8 1.3e-113 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

CE00306 1/1 31 416 .. 1 384 [. 390.8 1.3e-113 
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TATTATTATT ATTATTAAGA CGTAATCTTG CTCTGTTGCC CAGGCTGGAG 

51 TGCAGTGGCG TGATCTCAGC TCACTGCAAC CTCTGCCGTC CGGGTTCAAG 

101 TTTTTCTCCT GCCTCAGCCT CCTGAGTAGC TGGGATTACA GTCACGCACC 

151 ACCACGACCA GCTGATTTTT GTATTTTTAG TAGAGATGGG GTTTCACCAC 

201 GTTGGCCAGG CTGGTTTCGA ACTCCTGACC TCAAGTGATC TGCCTGCCTC 

251 AGCCTCCCAA AGTGCTGGGA TTACAGGCGT GAACCACTGT GCCTGGCCTT 

301 CATCTATATT ATTACCAGGA GGCAGATGTG TTCTCTTTTT CTCTGAGGTT 

351 TAGAATTATG CAAATGAAGA TATGAAAACA AAAGCTCAGT GAGGTGGGGA 

401 GGATTACACT TAAGAATACA GGTAATTTTC AAAGCTCTTT AAGACACCCC 

451 TCTCAGTTTT TACTAACAGC TCTCTCTTGG CTCTTTGCCA GTCTGTTTAG 

501 AATTTGGCAC CTCTTCATAA CCTTTCAACC AAAGACCTGT AAGTTCATTC 

551 TAAAGCTCCT ATCCTGGCCT CATTTTGCAA GTGGAGAAAT CAAGGCATAA 

601 AATATGAGCT TTCAGTGTCT GTGGGCTGAC CTTGAGTCTT GACCTTTATC 

651 CTGTTCTATC TTCCCTCCGC CGAAAACTCT GACCCTATTC CTCCCAGGTT 

701 CCCCCTTCAT GATATTATCT GGAGGGCAAT AGGACCTAGG GAGGTTCCAC 

751 CCTGCGGCGG AGGGAGACAC ACCTGCCTAA CAGCGTGGGT AGAGTGAGTG 

801 TTGAAGCAAG TCACTTAACT AGTTAGGGAG GGCGGGGTAG AAGTGGGGGC 

851 CTGCTGCTCC TAGGGAGGAG TAAAGCTGTG GCTCCTGCCT GGGTCTGGAG 

901 GTGGTGGTCA GAAGTGCTTC TGAAGAGCGG CCCAAGCCCC TTTTTGTCCC 

951 GCCACTCCAC AACGAGCATC CCTCGGCTGG CCGCCTGCCC GGGAACTCTC 

1001 CGGCTGGTTT TGTTTGGCCG CAGCCGTCCC GCCCATCTCG CCCGCCCCCG 

1051 CCGTCCCGGT GCCTTAGTTT TTGAAGCTGC CGACCTCTCG CAGCTGGAAT 

1101 CGCAGACCAG GCAGGACCCT GGCAGCAGAC GGCGTCCAAG AGTTTGGCGA 

1151 CCTCCGTCCA GCCAGGTTGG CGCCCCGCAC ATCGTGCCTC TCACTAGCAA 

1201 AGTTTCTCCG AGGAGAAGCA GCCCCTCCAG CCTTTTCTTC ATCCTGTAGA 

1251 GCGAGCGCGC TCTGCTTCTG TCCCTCAACA CTGCATTCGG AGACAGGGTG 

1301 GTGACAATAC TCCACTCCCG GGCCAGGCGG TCTTGGGGGC GGGGCTTGGG 

1351 GGAATCCGAG GAGCTATCCT GAGAACCCTG GACTCGGCAA AGGTCCTGAG 

1401 AGCGCGCAGG TGAGCGGGCC AGCTGATAGC TACAGCCTAG CAATAGCTAG 

1451 GATACCTAGG CACTGAACTG AATCCCCTCT TCTGCCCTCC TTCTTCTGCG 

1501 CCCGCTCTTC TGCCCTGGCT CAGCTCTCCG CTGACTTGAG AGGACACACT 

1551 GGTCAGGACT CTTTGTGAGG AGCTGCTGAG TGTOGGTGCC CCCGACAGAT 

1601 CGGCTACACC CTGCCTGAGG GGCTGCGAAA GGAGCCGCCA CGGAAGCCGC 

1651 TGTTCTCATG ACTCTTCACG TCCCTGGAGT TGGACTCTGG ATGGGGCGCT 

1701 GGGATGCTTG CTTTTGTCTT GTTCAAGTTT CACAGCAAGT ATGTTGACGA 

1751 TTGGAATCGG GGCCAATCAA GAGTCAAGTT CAAAGTGGTA CTCCTGGGCT 

1801 TTCCATCCCA GACTCCAAGT CGAATCTGAG TCTAGAAGAG AGCGGTTTCT 

1851 TGCTCTAACT AGTGAATCTC TGTTCCCAAA CTGGACTTGA CAGAGCTCTC 

1901 CTCACCTATA CTTGGACTGT AGCGGCCATA GGGTTCTCTT GGGGATGGGT 

1951 GGGAGGGTGC TATGAACACA AGACCACAAC ATTCAGAAAG AACCTCGACA 

2001 ATGGACAGAG TTTATGAAAT TCCTGAGGAG CCAAATGTGG ATCCGGTTTC 

2051 ATCTCTGGAG GAAGATGTCA TCCGTGGAGC CAACCCCCGA TTTACTTTTC 

2101 CATTTAGCAT CCTTTTCTCC ACCTTTTTGT ACTGTGGGGA GGCTGCATCT 

2151 GCTTTGTACA TGGTTAGAAT CTATCGAAAG AATAGTGAAA CTTACTGGAT 

2201 GACATACACC TTTTCTTTCT TTATGTTTTC ATCCATTATG GTCCAGTTGA 

2251 CCCTCATTTT TGTCCACAGA GATCTAGCCA AAGATAAACC GCTATCATTA 

2301 TTTATGCATC TAATCCTCTT GGGACCTGTT ATCAGGTGAG CAACTTTTAA 

2351 ATCTTTTCCT TACCCCCCTA ACCCCACCCC AGACTTGGGC AGAGAAAGAT 

2401 GAAAGATTTA CAAGATGGAT ACTATGGCTC TAATCAATTC TCTCATTTCC 

2451 TCCCACTCTC GGCTTCCCTG TCTACCATTC AGAAAACTTA CCTGAAATCT 

2501 TAAATGCCAC CATGATGAAC ATGTGGTATG TACTTGTGTT CCAAAACAAT 

2551 GAACGATGCT ATTTGGGCTG TGTAAACTAG AATGGGAACA ACAAGACGTG 

2601 ATCACCCTGT GCATGAAGGC CATAGCTGCA GAGTGTGTAA TTTTATTTAA 

2651 AAAAATTTTT TTTTCTGAGA CAAGGTCTTG CTCTGCCTCC CAGGCTACAG 

2701 TGCAGTGGTG CGATCATGGC TCACTGCAGC CTTGATCTCC TGGGATCAAG 

2751 CGAACCTCCC ACCTCAGCCT CCAAGTAGCT GGGACCAAAG GAATGTGTCA 

2801 CCATGCCTGG TTAATTAAAA AAAAATTTTT ATAGGCCGGG TGTGGTGGCT 

2851 CATGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGCGGGT GGATCACCTG 

2901 AGGTCAGGAG TTCAAGACCA GCTGGCCAAC ATGGTGAAAC CCCTGTCTCT 

2951 ACTAAAAATC AGCTGGGTGT GGTGGCGCAT ATCTGTAATC CCAGCTACTC 

3001 TGGTGGCTGA GGCAGGAGAA TCACTTGAAC CCGGAAGGTA GAGGTTGCAG 
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TGCACTCCAG 



CCTGGGCGAT 



AGAGTGAGAC 



3101 TCCATCTCAA AAAAAAAAAA ATTTTTTTTG TAGAGACGGG ATCTCGTTAT 
3151 GTAGACTGGG CTCAAGTGAT CTTCCTGCCT CAGCCTCCCA AAGTGAGCCA 
3201 CCACGCCTGG TCTGAGTGTG TAATTTTGAC TCTACCTTTT TGGATGCTTT 
3251 GTAAATTGGA TAAAAGTTTC TTTACCCTGA GCTGCTTGGG CTGGTGCTAC 
3301 TGCCATTTTC AAATTTTCCA GAGTAATGTG ACATCTGGAA ACTATTTTAA 
3351 ACCATCTGTG GTAATCTGTA CCCCAACCCA ATATAGTTCA GTTCTCTGTC 
3401 GGTTTATCAG TTTCCTATTT ATCTCTTTGT ATATTTCTGC AATAAAGATA 
34 51 CGAAGTTGGG AGGGGGCAAA GGAAGGCAGT TCATCTCTCT ATGTGGATGC 
3501 AGTAGCACAA TTTAATAGTA TCAAGTATTT CCATTCAGAT TGCCTTGAAG 
3551 TGGAAAGAAT GCACTTAATC CTAGCGAGAT AGGCACCTGT GTCAACAGTC 
3601 TCATCTGGAT GCTATGGGGT TTTCAAGGTA GAGAGATGTT GCAAAACTTA 
3651 TGAGTTCAGG AGTAAGGAAT GGACCAAGTT TGTCTTGATT GCGAGAGAGG 
3701 CAGACAACTG CAGTCAGCCG AGGAATATGG GTCAGAGTGT TGCAATGGGA 
3751 AGATACCTCA TCATTAGACA ACTAAAAAGT CTGTGAAACT AATTAAGGAT 
3801 GGAACTCACT CCTTTATAAA ATTTCATATC TGTACACATG TATAATTTTT 
3851 ATTTGTCACT TATACCTCAA TAAGGCCAAA AAAATTTTTT ATCAATAAAT 
3901 TTTTAAGTGG GGAGGAATCG ATTAGGCTCT ATCAGAGAGA ATATGGGATA 
3951 TCAATGGAAA CAGTGGCCTG AAATTTGGAG TCTAGTCTTC CGCCTGTCAT 
4001 TGACTGGTTG TGTGTTCTTG GTAAAATCTC TGAAGATGGC TTCACAGGAA 
4051 GGCATATAGA GTTCCCTCAT CTGTAAAGCA AATGGGTTAG TCTAAATCAT 
4101 GGGTCTCAAA CTCAAACACT TGCAGGGACC AGGCAGGTAT CATAAATGAA 
4151 TGAAGCAGGC CTAGTATAAG AAAAAACAGT AGCCTTGTGT GAGATGATAA 
4201 ATGGAAACAA AGTCTCAGAG AAATACTGAG GAGTAGTGAG TACCATGGTA 
4251 ATCTGAAATC TTCATGACCT GCCTGAAGGA GGTAGCCCCT CTAGAGCCCT 
4301 GGCGCATTGT TTCCATGTTG GAATTCAGAC CCAGTATTGC CAGATCCACT 
4351 AACTTTTCGG GAGATGCTCC CAAGACAGGA TTTTTATATG AAATGTCATG 
4401 ATTTTAAATT TTCACAGCTG ACTAAAACAA TAACAACAAC AACACAGGAT 
4451 GGACCAAACC ATATCTGTTG GTCAGATATA ACTCAGCTGG CCTATATGCA 
4501 TCTTTGGACT GGGTGATGTA AAGGTCCTTT ACGGTTCTAA ATCTTTGAAG 
4551 TTAAGCTGTA AAAGGAAGAC CTCATCTTGA CCTTGAAACC AAGAAATTTA 
4601 AAGTTGTGAC TACAGGAGCA AATAAACCAT TCATCCCTCC TTTTTCAAAT 
4651 ACAATATATT GAGTTAACCA ATCGAAAACT CTCAAGATAC AAATTTCAGA 
4701 AAGTACCCAG CTGCACCCTC CCCTCTTTTT GACTTCCTTT GTTTGCTTTG 
4751 TGAACCCTCT GTGTAGAGTG TTGAGTACTG TTTTTCATTT TTGTTGTTTA 
4801 GCTTCCACTA GAAATGATTG GGAAGCATTT ATAACCTCAG GCAGCTTAGC 
4851 CCACAGCAGA GAAAAGATAA AAACTCATAA ATTATACTCT GGATTCGCTT 
4 901 ATTTTCAAGG CCAATTACTT GTTAGATAGG TAGGAACTTG ATTAGTGTTA 
4 951 TCAGGCACAT GAAGGTGCTT GTAGAGTCTG GGTGCCTTAC ATGAAATGCA 
5001 AGCATACTTC CGAAATGAAA ATGTACTCTA ATTTATTGAA GCTTATAAAT 
5051 GGACAAACAC CCTTACTTAA ACCAGAAAAT AGCCCTGAGA ATAGAAACAG 
5101 AACATTTATG TAAATGTAAA CGGAACATTT CATGCCACCA CCTTCTCCAA 
5151 TACTGTTCTC CAATTTAGCA ATAGTACTGA TGGGTTGGGG TTAAAATCTA 
5201 AAATTTTTCA TTGAAAATGC ACTTATGCAG AACAAGAATA GGAAAAAAGT 
5251 GTTGCTTTTT CTTCTCTGTT CTTTCTTTGC ATCTTTTTCT TTCCCAGGTC 

53 01 TTAGAGTTTG TCCCTAGAAG GTGACAATTT CAAACTACAT GCTTCAGAGT 
5351 GGTACACATG CATCAGTCTT AGGGTGATCT ATGGAGACTG GCAGCCAGCA 

54 01 TATGTTCCAA ATTTTCCTAT CAGGAACTAA AGGCTAGAGA GCATATCAAC 
54 51 CTCTGGGCTT GTCTTTGGTC TACTTTTCTG TTAAATTTCA TTGCTGTTAT 
5501 TATTATCCTC TCCTCCCATA ATTGCTTACC CTGTATTATT TTCTTCCTTC 
5551 TTATTCTTTC ATTTACTCAG CAAATATTTC TCAAATACCT ACTAAGTGAT 
5601 AAGAGCTGTA AACAAGATAA ATACAACCCT TGACCTCAGT CTCTTGGGCA 
5651 AGACGTGTTA ATGTCCACTA CAAATGTTCT TACTAGTCAT AAGTAGTCCA 
5701 CAGTTTTTAT TCATTAAAGG TGAGTGGCGA AGTGGTAACT CAGGTGTTCC 
5751 AGTAACAAGA ATGTTCTAGT TGCTTCTCTT CCACTTACCA CATCAGAACT 
5801 GCTAAAGACT TCTGATTTGT ATGGGGGAGG TGGGAGGGGC AGAGCAGGAA 
5851 ATGTCATCTT ACCCTTATTC CAAGGATGAT AGGCTTTCAT AAGGATGTTT 
5901 TTCTCTTCGT AAAGAAAGAA TCCAGTTTAA AAGGCTTTTG TCCACAAACA 
5951 GGACAAGAGG CACAAAAAGT AACTATTACA GTGATCTTTC GAGGGCCTAG 
6001 TTATGTAGTT CATTCAGGTT TGAGTTGTCG TCTTTTAAGT ACTTTTGTTG 
6051 CTTTGATGGC TTCCTGTGTA TATGAGATAT TTTTTTTCCT CTGATCTGTC 
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CCAAGACTTT 
GAATCTGGCC 
AGCTGAGGTG 
CAACATGGTG 
TTAGCCGGAC 
GAGGCAGGAG 
GATCGCGCCA 
AAAAAAAAAA 
TTTGGTTATG 
TTTGAGGTAA 
AACAGTATCG 
CAGGCCTATT 
ACTTCTGAAA 
TTAGATTCAT 
AAATGATCAG 
CATGGTGTCT 
GGTCAAGGCA 
CAACATGTCT 
GGCATGCGCC 
TTTGAGCCTA 
CTCCAGCCAG 
AGCACAACCT 
AGGACAGCTT 
AGGCCATGAT 
CCCTATGTCA 
GCTGATAGAA 
GCAATGCCTA 
CCCCAGCTGA 
CCTGGGTAGA 
AAGGGTCTTA 
TTCATGTGGC 
TAATACTAGC 
ACTGTATTGG 
TATCGGACAG 
CAAGCCTGCC 
ACAGATAAAC 
GTAGGATATT 
TTGTTTAAGG 
ATCAGAATCA 
CTTGAATCAA 
TCAGCCACAT 
GAGAGAAAGA 
CCCAGCACTT 
AGACCATCCT 
AAATTAGCCG 
GCTGAGGCAG 
CGAGATCGCG 
TCAAAAAAAA 
AGAGTCAGGC 
GTTACAAAGC 
GAAATGAGTA 
AAGTTGAGAT 
CATCTGCTTG 
TTCCTATTAG 
GTAAGGAGAA 
TCTTGAAGTT 
GAGAAAACTC 
AAATAGGTGT 
ACACTTTGGG 
ACCAGCCTGG 
AAATCAAGGA 



TTGGCTGAGA 
AGGCGCAGTG 
GGTGGATCAC 
AAAACCCGTC 
ATGGTGGCAC 
AATCGCTTGA 
GTGCACTCCA 
AAAAAAGAAA 
GAGCTGAATG 
GAAAACAGGG 
ACTTAATACC 
CTTAAAAAGA 
CAGTGTTCTT 
GTAACTTGTA 
AACAGAAGAG 
CACACGTGTG 
GTAGGATTGC 
AGATCTCCTC 
TGTAGTCCTA 
GGAGGTCAAA 
GGTGATGGAG 
CCCCAAGTTA 
AGTTAAGTTT 
TAAGTACCTC 
GCCTCACCCG 
TGGGAGGTGG 
CAAACGTATG 
CCTATCAGCT 
GGTGAGTGGG 
GAAGTCTAGA 
TATTTAACTT 
TACATTTCAA 
ATGGCACAGG 
CACTGGGAGA 
AGATTTCTTA 
CTTCTCTGGG 
TTCCCTTGCC 
ATGGCAGTCT 
TGTGATGAGC 
TCTTAATATG 
TTGGGAACCA 
AAAGGAGCGT 
TGGGAGGCCA 
GGCTAACATG 
GGCGTGGTGG 
GAGAATGGCG 
TCACTGCACT 
AAAAAAAAAA 
CTGGACCAAG 
AATCACTGAC 
AGACCTCTAG 
AAATTGTTGT 
GGCAGGCACT 
ATTCTAACCC 
AATTTGCCCT 
CTGTTGGTCT 
TTTGGAAATG 
ACATTACGGC 
AGGCCAAGAC 
CCAACATGTT 
TGGGCATGGT 



TATGGTTGTG 
GCTCATGCCT 
CTGAGGTCAG 
TCTACTAAAA 
ATGCCTGTAA 
ACCCAGGAGG 
GCCTGGGCGA 
GAAAGAAAAA 
CTTTGAGGGA 
CCTTTGGAAC 
TTTGTGTTCA 
CAAAACAATT 
AGGTCAGTCG 
GAAGGCATGA 
AGAAAAAGAC 
ATCCCAGCAC 
TTGAGCCCAG 
TCTACACAAA 
GCTACTCAGA 
GCTGCAATGA 
TAAGACCTTG 
TTTTTTTCCC 
GCATGATGCT 
ACACTGTGGA 
AAAGAAGATG 
GCCACTCCAT 
TCACAGATCC 
CTATGTGAGC 
GTCAGGAGAG 
CCCAAGCTGT 
AAAATTAAAT 
GTTCTCAGCA 
TATAAACATT 
TAGTTAAATA 
AACAGGTAAG 
CCCAGAAGAA 
CAAAATGTTT 
TTAATAAGAG 
TTCTTTTTGA 
TATTTCTAAA 
ATGATTTAAT 
GGGCTGGGCA 
AGGCGGGTGG 
ATGAAAACCC 
CAGGTGCCTG 
TGAACCTGGG 
CCAGCCTGGG 
AAAAAAAAAA 
TTCTGACCTT 
CTAATTTCCC 
CAGAAGATGA 
TAATGAATTT 
GGTATAGGTG 
TGTTTGGCAA 
CTTTTGTCTT 
TGAAGCTTTT 
ATGATGCTTT 
TGGGCATGAT 
AGGCAGATCA 
GAAACCCCAT 
GGCTGATACC 



AGCCCTTTCT 
GTAATCTCAG 
GAGTTCAAGA 
ATACAAAAAA 
TCCCAGCTAC 
CAGAGGTTAC 
CAGAGCAAGA 
GAAAAAGCAG 
GGAATGCTTT 
CTTCATTATT 
AGGCACTTTT 
CCTCGAGTCC 
ATGACTGAAC 
TCCACCCTTT 
AAAAAGTAGT 
TTTAGGATCC 
GAGTTTGAGA 
TTAAAAATAG 
AGGCTGAGGT 
ATTATGATTG 
TCTCAAAAAT 
TCACTACAAC 
TTACTTCTGC 
AGAAAGAGGA 
CTAATAGATG 
CCGGACCCTG 
AAGCCTTCCT 
CTGATCTCTG 
GGGAGGGCTC 
CTAATAAACT 
AAAATTAAAA 
GCCGTGTGTT 
TCCATCATCA 
ACTTGTGGAG 
CTGTTTAGAC 
GCTAGTAATA 
AAATTATGCT 
GTTCCCAAAT 
AATTATATTC 
AGGTACCCAG 
CATTTCTGCT 
CGGTGGTTCA 
ATCACAAGGT 
GTCTGTACTA 
TAGTCCTAGC 
AGGCGGAGCT 
TGACAGAGCA 
GCGTGGGGTT 
CACTGTGATC 
ACTTGTAGAA 
AATGTGAGTC 
TAACAGCCTG 
TGGGTACAGG 
AGTCCCATGT 
TTTTCCCCAC 
CACATACATA 
TCCTTTAAAT 
GGCTCACGCC 
CTTGAAGTTG 
CTCTACTAAA 
TGTAATCCCA 



TGAAAAAGCA 
CACTTTGGGA 
CCAGCCTGGC 
AAAAAAAACC 
TCAGGAGGCT 
AGTGAGCTGA 
CTCTGTCTCA 
AATCTAAAAC 
ACCTCACGAA 
TTGCTAGGAA 
CTACCTGCCA 
TCAAACAAGT 
AAAAATGGAT 
GACTTATGAG 
GCAGGCTGGC 
CAGCACTTTG 
CCAGTCTGGG 
CTGGCATGGT 
GGGAGGATCA 
TGCCACTGCA 
AAAATAAAGT 
CTCCCTTCCC 
AGATGTTTGG 
GCAGGAGGAG 
GCGAGGAGGT 
GCTATGCACC 
GGGCTCAGTG 
CAGAGGTTCC 
CAGTTAAATC 
GGCCACTAGC 
ACTTGTTCAT 
GCTAGCAACT 
CAGAAAGTTC 
TCAGACATCT 
TAAAAATGTC 
CCAGCAGTCA 
GTTGTTTTGT 
AGTACTGATC 
ACTCCCCAGA 
TTGATTTTGA 
AATGCCAGTG 
AGCCTGTAAT 
CAGGAGATTG 
AAAATACAAA 
TACTCGGGAG 
TGCAGTGAGC 
AGACTCCGTC 
AATACTAATG 
TTTGGAGGAA 
GAGGGATCCT 
AGTGTTTTCA 
AGATTTGCTT 
TTTGGACCAT 
CTCAAATAAG 
TCAGAATTGT 
GTAGTTTGAG 
CATCTAATAA 
TGTAATCCTA 
GGAGTTCAAA 
AACACAAAAA 
GCACTTTGGG 
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9151 AAGCCGAGGC AGGTGGATCA CCTGAGGTCA GGAGGTTGAG ACCAGCCTGG 

9201 CCTGGCGAAA CTCTGTCTCT ACTAAAAATA CAAAAATTAG CCGGGTATGG 

9251 TGGTGGATGC TTATAATCCC AGCTACTTGG GAGGCTGATG CATGAGAATC 

9301 ACTTGAACCT GGGAGCCGAG ATCTCACCGT TGCACTCCAG CCTGGGCAAC 

9351 AGAGCGAGAC TCTGTCTCAA AAAAAAATTC AGCCAGGCGT GGTGGTGGGT 

9401 GCCTGTAATC CCAGCTACTT GGGAGGCTGA AGCAGGAGAA TTGCTTGAAC 

94 51 CTGGGAGGTG GAGGTTGCAG TGAGCTGAGA CTGCACCACT GCACCCCAGC 

9501 CTGGGCGACA GAGGGAGACT CCCGTCTCAT AAATAAATAA ATAAATAACA 

9551 AAAGTAATAC ATGCACAAAA TGACATATAA GTAATTGTAT TTGCACAGAA 

9601 AATTTCTGGA AACTATGCAA GAAACTACCT CTGCGGAGTG GGAATGAAAA 

9651 GTCAGCAGTC TTACTTTTTA AAATTCTTCT GTATGGTTTG AAAATTTTTT 

9701 TTGTGATCAT GCATTACTAG TTTTGGTCTT TATCTTTTTT TAATTACAAA 

9751 AGTCAGACAT GGTTATAGTA AAAATTAAAA ACCATACAGA ATAGATATAA 

9801 AATAGGAAAC GTAATCTCAC TCCCCAAAGA TAACCTCTGT TAATCATCCA 

9851 GTATATATCC TTCTGGACTT ATTTTTACTA TGTAAACATA AACATACATA 

9901 CAATATATAT TGTACATGTT TTTGCCCCAA AATGGACTGT ATGAAACATT 

9951 CTGTCAACAA AGTATTTTTC AAAAGTACAG TATGCCAGTA TGTCTTTTCT 

10001 CAAGTTATTT ATATATACAT GTATAACAAT AATAAATATA TAATATACAT 

10051 TTCCTTTATA TGAATTAGAC TATTTTTATT CTCCTAATTT TCTATTGATA 

10101 GGATTCTATT GATTGTCTCA AAAAGGAAAA AAAAAGGTAG CACAACCTCC 

10151 CCTAGTTATT TTTTCCCCTC ATTACAACCT CCCTTCTCAG GACAGCTTTA 

10201 GTTAAGTTCC CATGATGCTT TACTTCTGCA GATGTTTGGA GGCCATGAGT 

10251 AAGGACTTCA CACTGTGGAA GAAAGAGGGG CAGCAGGAGC CCTATGTCAG 

10301 CCTCACAAAT TTTAATTTTT CACAAAAAAG TTGTTTCCTA ATTGCAAATT 

10351 ATGCCACAGT AAACATCTTT ATAAATACCT GTGTACATGA ATGAGACTTT 

10401 GTAGGATAAA TTTATAGCAG TAGAATTGCT GGGTGGAAGG ATATGTATGT 

10451 TTAAAATTTT ATTGATATTG CCAAATAACT CTTCCAAAAA GATATATGAA 

10501 TTTATACTCT CACCAACAGT ATACAAATGT GCCTGTTTCT GTTTCTTCAT 

10551 ATCTTAAACT CAATATTCTT TATTTGTATA ATTATAAAAT AATTGGCTTT 

10601 TAAAATAATT GACTTTTAAA ATAATTCGCT TTCTTTGGTT ATGAATGAAG 

10651 CTGAGCATCT TTTTGTGTTT GGTCATTGTG TGTTCTGTGA ATTGCTTGTT 

10701 TATATATTTT ACTCGCTTTT TCTAGTGGGT TGTCTTTTTC ATATTAATTT 

10751 TTAGGAGCTA TTTACTTATT CTTGTTATTA ATCCTTTCTC TGCTGTGAAT 

10801 ATGTATGCAT ATATTTGTAT AATTTTTTGC TTGTACATAC ACACATTTTA 

10851 AATATGTATA TACATGTCAT ACGTGTAATA TGTGTGTGAT ATATTTAATA 

10901 TCCACAATAC ACTTTGTAGT ATCTTCTGGC ATTCTGAAGT ATTACATTTT 

10951 TATGTATTCA AATTCTTTAT TGCTTTTAGA TTTTGTGCCT TTCTTACAAA 

11001 GGCCTATCAC ATCTCTCATC TGGTAGAACA ATTTTCCCCA ATCTTTTAAG 

11051 TAGATTAATT TTCAAGATAA TTTTTTAATT CATCCTACAA AAAACAAAGC 

11101 AAAATAATAA CAGCAAAAGA AAAAAACATT TCATTGAGAT TCCGATTGAG 

11151 ATTTGCATCA AATTACTTAG GTTATTTTGG GAGAATTTAC ATCTTTATAG 

11201 GATTGTTGGA TTTCATATTG TGAAATGATA AATCTCTCCA TTTTATTAAA 

11251 TATTTTAACA TGTACCTCAG TAAAATTGTA TAGTTTTCTT CAGTAAAGTT 

11301 GTATAGTTTT CTTGCTATGA GTCTTACATT TTTATAAGGC TTACTTTCAG 

11351 ATGTTCTATC AGTTTTAAAA TGACCTGATT TTCTAAGTAG CAGGATAGTA 

11401 TCCAGGTAAA GTAAACCCAC CTACCATACT TTTGGAAATA GGGGGATGAT 

114 51 GAAGATGACA AAGAATAGGA AGAAAGAGGA GGAGGGGGAG GAGGAGAAAA 

11501 AGGAAAGGAA GAAGGGAAGA AGGAAGAAGA AGAAAGAAGA ACACAGCTAA 

11551 AAGAATTTAC TAGGTTCTAG GCATTTCTCT AAGCCCTTTA CATGTAAATG 

11601 TTTATTTAAT CTTTACCACA ACCCTATGAG ATAATTATCA TTCTCATTTT 

11651 ACAGACGAAG AAACAGACGC ACTGAGAGTT TAAGTATATT CCCCCAAGGT 

11701 CCCATAAGCA AAGATTGGAT TGGAATTCAG GGTGTTTGCC TCCAGAGCCT 

11751 GTGTATTTTG TTCTCTTATG GCATGAGTGT ATTTGTAGGG ACACAGATTG 

11801 AAAATGTTTT GACATTTATT GGAAGCATCA GGTTTTTTTC CTTCTGTTAC 

11851 ACTACTAATC AATAAATGAG TTCTAATGTA AGGGAAAGCA TCGCCACACA 

11901 GCTGGATGTA TGCTCTCACA TTCCCAGTTA CATAAGGTGC ATCAGCTCTT 

11951 GAGGATGGGA CTGAGAATGG TTGAGAAAGA CAAGAGTCAC CACTTCAAGA 

12001 GTCTCCAATT CCAGTCTCTG AGATTCCAAC ACTCTACTTA AAACTGAAGA 

12051 ACTCAGAGCT GTGCTTCCTT TTGGGTTTAC ATGGGGGAAA TCTTAACTTT 

12101 TCCTCCACTA AAAGTAAAAG ATTAAGTTGA CATCTCTATG GCCACCTTTT 

12151 CCCTACATCA AGTGTTTTAA TAGGAACAGA AAACTCCAGC TTTCCTTTTG 
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12251 TTAGTAATCA GCATCCTTTG ACATAAAGGA AGAAAAGGAA AGGGCCACCT NiJ!lE>^ 

12301 GTGTCATCTA TAGTTGAGGC TTAGGTAGGT TAGGGAGCAT CCTGGCTCTT 

12351 TAGGGCCACT ACTCTAACAT ATGGTTCCAT GGATGTCATG GGTGAGGCAA QC^f^PIX/F 

12401 CAGGGTTTGG AAATTTTTGA ACTACTCTGC TGCAAACTCA GAGATTCCTA tl tZ W CZ I V L- 

12451 ACATATGGGG GTAATGAATT GACATTGCTG ATGACAAATA TAAGCAACTC 

12501 TTGAGTATCT CAGTGAATTG AGAACTGAGG TACATAGATA TTCAGTGACT DtC 0 V 2003 

12551 TCCAAAAGTT CCCATACAGC TGAACCAAGG ATTTCTTTCT TTCTTTCTTT 



12651 TTCTTTTCTT TCTTTCTTTC TTTCTTTCTT TCTTTCTTTC TTTCTTCTTT 
12701 CTTTCTCTTT CTTTCTCTCT TTTTCCTTTT CTTTTTCTTT CTTTTCTTTC 
12751 TCTCTTTCTT TCTCTCTCTC TCTTTCTGTC TTCCTCCCTT CCTCCCTTTC 
12801 TCTCTTTCTC CCTTTCTTTC TTCCCTTCCT CCTTACAGGC ATGCACCACC 
12851 ATGCCCAGCT AATTTTTGTA TTTTTAGTAG AGTACCGGGT TTCACCATGT 
12901 TGGTCAGGCT GGTCTTGAAC TCCTGACCTC AGGTGATCCA CCCACCTCAG 
12951 CCCCGCAAAG TGCTTGGGAT TACAGGTGTG AGCCACCGTG TGCACGGCTG 
13001 GAACCAAGGA TTTCTAATTA GTTTTATTTT TTATTTTTTT TCTTTTTGAG 

13 051 AAGGAGTCTC ACTCTGTCAC CCAGGCTGGA GTGCAGTGGG GCAATCTCAA 
13101 CTCACTGCAA CCTCTGCCTC GTGGGTTCAA GTGATTCTCG TGCCTCAGCC 
13151 TCCTGAGCAG CTGGGATTAC AGGCATGCCA TCATGCCTGG CTAATTTTTT 
13201 TTTCTTTCTT TTTGAGACAG AGTCTCACTC TGTTGCCCAG GCTGGATCGC 
13251 AGTGGTGCAA TCACGGCCCA CTATTACCTC TGCCTCCCAG GTTCAAGTAA 
13301 TTCTCCTGCC TCAGCATCCC AGGTAGCTGG GAATACAGGT GCACGCCACC 
13351 ACGCCTGACT AATATTTGTA TTTTTAGCGG AGATGGGGTT TCATCATGTT 
13401 GGCCAGGCTG GTCTCGAACT CCTGACTTCA GGTGATCCAT CCGCCTTGGC 
13451 CTCCCAATGT GCTGGGATTA CAGGCATGAG TCACCGCGCC CAGCCTAACT 
13501 AGGTATTTTA TGCACCTCTC CTAATCTCAG AAGTCTTCAT TAATTCCACA 
13551 AACATTTATT GAGCACCTGC TATGTTCCAG GTAATATGTT AGGCTATGGG 
13601 AATACAGCAG TGAAGAAAAC ATGGTCCCTC CTGCCTTCAT GGAATTTTCA 
13651 ATACACATTT TGACACATCA CTGAAGCTAA GTGTTCTAGA AACACACAAA 
13701 CAATGTTAGT TCCTTGAACA AGATATACAT CAAAGAAGGG ACTTCTATTA 
13751 GCAAGAGCGT TCTCTATGAG TCTCCTAAGA CTGGATTTTT TCAGATAGAG 
13801 TTCTTTCCGC CTTATTCAAT GTTTGCTCCG AAGCCTGCTT CATCAGCAAA 
13851 GTCTGCCTGA TACCTTTATA TGTACTCTTC TCACGTTAGT GACTTCTCAA 
13901 TGTTCTAAGA CCCATGCTTT TTAAGGAAGT TTATTTTGTA TATTTATATG 
13951 ATTATTAAAG TGTTACAGTA TATGTTCATC ATGAGAAATT TAGAAAATAG 
14001 AGAAATGTAG AGAAAAAGAT TTCTAAAACT GATATAAGAC TATCACACAC 
14051 AAAAAAAGAT ATTTTGGTTC ATTTTTTCAA TTTTTTGTGC ATCTATTTTG 
14101 TTTTATTGTA TATATTCAAG GTGTACAATG TGATGTTTCG ATGTATGTAC 
14151 ACATTGTGAA ATGATTACCA CAACCAAACT AATTAACACA TTCATCACCT 
14201 CACATAGTTA TCATTTTTGT ACGTGTGTGT GTGTGTGTGT GTGTGTGTGT 
14251 GGTAAAACTT AAGATCTACT CTCTTTAAAA ATTTCAAGTA CACAATACAT 
143 01 TATTGTCAAC TATAGTCATC ATGTTGTACA TTAGAGCTCT GAAACTTATT 

143 51 TATCTTATAA CTCTAAATTT GTAGCCTTTG ATCAAAATCC TTCTATTTCC 

144 01 CTAAATCCCC ATCCCCTGGT AACCACCCAT TCTACTCTGT TGCTAGGTGT 
144 51 TCAACTTTTT TAGATTCCAC ATATAAGTAA GACAATGCAG TATTTTTCTT 
14501 TATGTGTCTA GCTCATTTCA CTTAGCATAA TGTCCTCTAG GTTCATCTGT 
14551 GTTGTCACGG ATGGCAGAGT TTCTGTAATT TTATGGTTGA ATAATATTCA 
14601 TACACACACA CACACACACG CACACACACA CACACACACA CAGACACACC 
14651 CACCAGATTT TCTTTATCCA TTCATCTGTC AACAGATACT GAGTTTGTTT 
14701 CCATATCTTG GCTATTGAGA ATAATACTAC AATGAGCATG AGAGTGCAGA 

14 751 TATCTCTTTG AGATACTGAT TTCCTTTAGG TATACACCCA GCAGTGGGAT 
14 801 TATTTGATCG TTTGGCCGTT CTGTTTGTAA TTTTTTTGGA GAACCTCCAT 
14 851 GCTGTTTTCC ATAATGGCTG TGTCAGTTTA TGTTCCCACA AACAGTGTAC 
14 901 AAGGTCCTTT TTCTTACATC CCCACCAACA CTTTTTTTTT TTAATAATAG 
14 951 CCATTCTAAC AGGTGTGAGG TGATATCTCA TTGTGGCTTT GATTTGCATT 
15001 TTTGTGATGA TTAGTGATGT TGAACACCTT TTCATATACC TGTTGGCCGT 
15051 TTGTATACCG CCTTCGGAGA AAGTCTATTC AAGTGCATGC TATTTGTTTA 
15101 CATAGCTGTG ATCATATTTG CATTTGCTCT TAACTGGAGC TCTCAAGTCT 
15151 CACCCGTCAT CTCTCTGGAC CTCTGGGTTA TAAGTACAGC CTTCATTACC 
15201 AACATTGACT GATTGCCTGT TTTTGTTTTT GTTTTTGTTT TTAACAGTTG 



12601 CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT 
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TGCTAATGGT 
AATATGTTGG 
GCCACTAGAA 
CCCGCCTCCT 
GTGCCCTTCC 
TAAGTTCTGG 
TCAGCCGGGT 
GCTGGCATCA 
CAGAGATCTC 
ATAGTGTGAG 
TTTGGAGTGA 
GCTCATTATT 
AGTACTTGCA 
CTCCATTGTG 
AGAGCCACCC 
TTCTGGGTAT 
TGAACGAAAG 
TGACTTTTCT 
TTATTTTCTT 
TATAGTTGAA 
ATTGGAATAA 
AGCTCTGCTC 
AAGGCCCAGA 
AGCTTACCAG 
CCTGCAACTG 
TGGCTGAGAA 
TAGCTCTCTG 
TACAGTGGCA 
TCCTTCAGGG 
GCCTCCAGCT 
CTCATTAGCA 
TTACTACTTG 
TCCGATGTGT 
GTATTTCATT 
TATGTGTGTA 
GTTTGCGATT 
GGAGGGAAAC 
CAGGAGTTTA 
GTTTTCTAGG 
GCCTCCTGGG 
CCACCCGCCT 
CACGCGGCCA 
TCTGTTCGTT 
TATATCTCAA 
TCAGCCAGCT 
TGCCTATTGG 
TCACAGACCA 
TTCAAGAAAT 
TTTACTCTTG 
CTTATCCAAT 
CAGGGCTGTT 
TCCCTTCATC 
GGAGAGACTG 
ACCCCAACTC 
CATATAAAGC 



ATTTTCCCTG 
CTATCCAGAT 
GTCCTCTGCA 
GATTCTGGTG 
TAGTGCTCAA 
AGAAGTGGTG 
CGGCACTCTG 
ACTTCTCTTG 
GTCGACAAAG 
GTTGGTAGAG 
AAGTGTTACT 
GCTTATCTGA 
TCCATTGCGC 
TCTGCTGTCA 
TTTGAGACTG 
TTTAGGAAGA 
GATGAGGTTC 
GTTAAGCCTA 
AAGAGTTACC 
AGGGTCAGAA 
CTTGTCTTCA 
ATTTGCTTGA 
ATCCCGCTAG 
CTAAGAGTAC 
TCAGTGTGGC 
GAAAATCACC 
TTAGGATTTT 
CTTCATAGGC 
CATAAGCCAA 
TGTCTATTGG 
GAGATTTCCT 
ATGTGGACCT 
GGAAGAAATG 
AATGTGTGTG 
GGGAAGAAAG 
GGGATGGGTG 
AAGATCACAT 

CTGAAGTGCA 
TTCAAGCAGN 
CGGTCTCCCA 
GGATTTTACT 
CCTATACCCC 
TGTTTACATC 
ACCTCTGAGG 
GACCAAATAT 
GGAAAATTAA 
TGTGAACAAT 
GTAGCATGCT 
GTCAGGACAT 
AACCAAAGTA 
AGCCACTGTT 
TATGGACTTG 
CCCCATTCCA 
TCACCTGCTG 



GTATCTGTCA 
CAAGTACGAT 
TCACCATCTG 
CTCTTCTCAG 
CTTCCTGATC 
CCCAGATGCC 
GTGGTCCTGA 
CTGGTCAGCT 
GGCAGAACTG 
AATGTGATCA 
GAATTACTGT 
TTTCCATTGG 
TCACTCTTCA 
CCAGCACCCT 
AAGCAAGGCA 
GTTGGGAGTT 
ATGGGTGAGA 
TCAGAAGAAA 
ACTATGTTTG 
ATACCATTCA 
AACACTTTAG 
TGCATTTCTG 
AATATATCCT 
CTCCCAGGAA 
TGGGGTCTTT 
ACTATTAAAA 
ACTAGTGGCC 
ACACTCTAAG 
AATGACTCTT 
AATTGTTAAG 
CCAACCCAGC 
CAGAGAAGCT 
AAGACTGCTT 
TGTGTGTGTG 
TAATAATGGC 
ACTAACACTC 
TAAATCATCA 
TTTTCCTTTT 
GTGGTGTGAT 
NNNNNNNNNN 
AAGCACTGGG 
TTATAACAAG 
CAGTGGACGA 
TCATATCAGT 
TAACCAAGGG 
CCCAGCATTT 
AAGAACTCCT 
CTGACCTAGG 
AAGAATTTAT 
TATGTTGAGG 
TCTTATTTCT 
TGGGACTAAA 
CTTTGGAACA 
TTTTTCATGA 
TCTTCTAACT 



CCTATGGGGC 
GACTACAAGA 
GCGGACATTG 
CCACTTTGAA 
ATCCTCTTTG 
CAATAACATT 
TTTCAGTCAC 
TTGCAGTTGA 
GGGACATATG 
TGGTCTTGGT 
CATTCCTTGA 
CTTCATGCTC 
CCCATAATGT 
CGGACCAGGG 
AAGTGTTGTC 
GCCAAGAGTA 
TACCCATCAG 
GAGCAACTCC 
GAAACAGGGG 
CACCCTTCTT 
GCTCTCTAAA 
AGCTTTCCTG 
GACTGATCAG 
ACAGTCTGAC 
TTAATTCCAG 
AAGCTGCTCC 
ATTCAGCAAG 
GAGAAAGTGC 
TTTCTCAGGG 
TGAAGCCTCT 
TTTTCTGTGC 
GAACTGTAAT 
TGTGTCTGCT 
TGTGTGTGTG 
TGAGACATCA 
CAAGGTAGAG 
TCAGTACTGG 
TTGAGATGGA 
CTTGGCTCAC 
NNNNNNNNNN 
ATTACAGGCA 
GAACATATGT 
ATGCATGTCT 
TGGGTATTTT 
ATTGAAGTTA 
ACCTAACTAA 
AGTCGTGGCC 
GCTTCCTGTC 
CTTTAGTCAT 
GAGTTCTCTC 
TGGCATGGCT 
AGGATTATAT 
GTGGAGAGCT 
TGAAGAGACT 
ATGTTATTCA 



CACCCTTTGC 
TTCGCCTTGG 
GAGATCACTT 
ATTGAAGGCT 
AGCCCTGGAT 
GAGAAAAACT 
CATCCTCTAT 
GGTTGGCAGA 
GGCCTGCACT 
TTTTAAGTTC 
TTGCCTTGCA 
CTTTTCTTCC 
AGTAGACTAC 
TTGAGAACTC 
TGATTCTATT 
ACCATGAAAT 
TACATTTTCT 
CAAATAGGTT 
GTATCGACTA 
ACCCAAGTCA 
GTGACCTTCT 
GGCTGAGCTG 
AGGATATGAC 
TAATGTGGAA 
TGAGAAGCTC 
CCAAGCAGAT 
GACCTCTCTT 
AGAGTAGAAT 
ACCTGCATGG 
CACTTAGTGC 
TCTTGGTATT 
TGAAAATGTT 
GTTGTCCTGA 
TGTGTATGTG 
CCTTCATGTT 
TGAAGGCAGA 
TTTCTGCCTA 
GTCTCGCTCT 
TGCAGCCTCT 
NNNNAGTGAT 
TGAGCCACCT 
TTATCAACCC 
CCTTTTCTCC 
GATAGGAATG 
CTATGGCCAC 
TGCTTGCCCC 
ACCACAACAC 
CTCATCCAAT 
TTCCTCTCCT 
TTCTAAGTAG 
AGCATGGTTT 
ACTTAATTTG 
CCTTTCTTCA 
TAGTTATTGT 
AGG {SEQ ID NO: 3) 
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DNA 

Position 

2584 ATAAACCGCTATCATTATTTATGCATCTAATCCTCTTGGGACCTGTTATCAGGTGAGCAA 
CTTTTAAATCTTTTCCTTACCCCCCTAACCCCACCCCAGACTTGGGCAGAGAAAGATGAA 
AGATTTACAAGATGGATACTATGGCTCTAATCAATTCTCTCATTTCCTCCCACTCTCGGC 
TTCCCTGTCTACCATTCAGAAAACTTACCTGAAATCTTAAATGCCACCATGATGAACATG 
TGGTATGTACTTGTGTTCCAAAACAATGAACGATGCTATTTGGGCTGTGTAAACTAGAAT 
[G,C] 

GGAACAACAAGACGTGATCACCCTGTGCATGAAGGCCATAGCTGCAGAGTGTGTAATTTT 
ATTTAAAAAAATTTTTTTTTCTGAGACAAGGTCTTGCTCTGCCTCCCAGGCTACAGTGCA 
GTGGTGCGATCATGGCTCACTGCAGCCTTGATCTCCTGGGATCAAGCGAACCTCCCACCT 
CAGCCTCCAAGTAGCTGGGACCAAAGGAATGTGTCACCATGCCTGGTTAATTAAAAAAAA 
ATTTTTATAGGCCGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAG 
(SEQ ID NO: 20) 

2655 TTTCCTTACCCCCCTAACCCCACCCCAGACTTGGGCAGAGAAAGATGAAAGATTTACAAG 
ATGGATACTATGGCTCTAATCAATTCTCTCATTTCCTCCCACTCTCGGCTTCCCTGTCTA 
CCATTCAGAAAACTTACCTGAAATCTTAAATGCCACCATGATGAACATGTGGTATGTACT 
TGTGTTCCAAAACAATGAACGATGCTATTTGGGCTGTGTAAACTAGAATGGGAACAACAA 
GACGTGAT.CACCCTGTGCATGAAGGCCATAGCTGCAGAGTGTGTAATTTTATTTAAAAAA 
[A,T] 

TTTTTTTTTCTGAGACAAGGTCTTGCTCTGCCTCCCAGGCTACAGTGCAGTGGTGCGATC 
ATGGCTCACTGCAGCCTTGATCTCCTGGGATCAAGCGAACCTCCCACCTCAGCCTCCAAG 
TAGCTGGGACCAAAGGAATGTGTCACCATGCCTGGTTAATTAAAAAAAAATTTTTATAGG 
CCGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATC 
ACCTGAGGTCAGGAGTTCAAGACCAGCTGGCCAACATGGTGAAACCCCTGTCTCTACTAA 
(SEQ ID NO: 21) 
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3693 TCTCTGTCGGTTTATCAGTTTCCTATTTATCTCTTTGTATATTTCTGCAATAAAGATACG 
AAGTTGGGAGGGGGCAAAGGAAGGCAGTTCATCTCTCTATGTGGATGCAGTAGCACAATT 
TAATAGTATCAAGTATTTCCATTCAGATTGCCTTGAAGTGGAAAGAATGCACTTAATCCT 
AGCGAGATAGGCACCTGTGTCAACAGTCTCATCTGGATGCTATGGGGTTTTCAAGGTAGA 
GAGATGTTGCAAAACTTATGAGTTCAGGAGTAAGGAATGGACCAAGTTTGTCTTGATTGC 
[G, A] 



AGAGAGGCAGACAACTGCAGTCAGCCGAGGAATATGGGTCAGAGTGTTGCAATGGGAAGA O p^, 

TACCTCATCATTAGACAACTAAAAAGTCTGTGAAACTAATTAAGGATGGAACTCACTCCT t^I O 

TTATAAAATTTCATATCTGTACACATGTATAATTTTTATTTGTCACTTATACCTCAATAA 
GGCCAAAAAAATTTTTTATCAATAAATTTTTAAGTGGGGAGGAATCGATTAGGCTCTATC 
AGAGAGAATATGGGATATCAATGGAAACAGTGGCCTGAAATTTGGAGTCTAGTCTTCCGC 
(SEQ ID NO: 22) 



3992 CGAGAGAGGCAGACAACTGCAGTCAGCCGAGGAATATGGGTCAGAGTGTTGCAATGGGAA 

GATACCTCATCATTAGACAACTAAAAAGTCTGTGAAACTAATTAAGGATGGAACTCACTC g 
CTTTATAAAATTTCATATCTGTACACATGTATAATTTTTATTTGTCACTTATACCTCAAT 
AAGGCCAAAAAAATTTTTTATCAATAAATTTTTAAGTGGGGAGGAATCGATTAGGCTCTA 
TCAGAGAGAATATGGGATATCAATGGAAACAGTGGCCTGAAATTTGGAGTCTAGTCTTCC 
[G,C] 

CCTGTCATTGACTGGTTGTGTGTTCTTGGTAAAATCTCTGAAGATGGCTTCACAGGAAGG 
CATATAGAGTTCCCTCATCTGTAAAGCAAATGGGTTAGTCTAAATCATGGGTCTCAAACT 
CAAACACTTGCAGGGACCAGGCAGGTATCATAAATGAATGAAGCAGGCCTAGTATAAGAA 
AAAACAGTAGCCTTGTGTGAGATGATAAATGGAAACAAAGTCTCAGAGAAATACTGAGGA 
GTAGTGAGTACCATGGTAATCTGAAATCTTCATGACCTGCCTGAAGGAGGTAGCCCCTCT 
(SEQ ID NO: 23) 

6285 TCTTTCGAGGGCCTAGTTATGTAGTTCATTCAGGTTTGAGTTGTCGTCTTTTAAGTACTT 
TTGTTGCTTTGATGGCTTCCTGTGTATATGAGATATTTTTTTTCCTCTGATCTGTCCCAA 
GACTTTTTGGCTGAGATATGGTTGTGAGCCCTTTCTTGAAAAAGCAGAATCTGGCCAGGC 
GCAGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAAGCTGAGGTGGGTGGATCACCTGA 
GGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAAACCCGTCTCTACTAAAAATAC 

AAAAAAAAAAAAACCTTAGCCGGACATGGTGGCACATGCCTGTAATCCCAGCTACTCAGG 
AGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAGAGGTTACAGTGAGCTGAGATCG 
CGCCAGTGCACTCCAGCCTGGGCGACAGAGCAAGACTCTGTCTCAAAAAAAAAAAAAAAA 
AGAAAGAAAGAAAAAGAAAAAGCAGAATCTAAAACTTTGGTTATGGAGCTGAATGCTTTG 
AGGGAGGAATGCTTTACCTCACGAATTTGAGGTAAGAAAACAGGGCCTTTGGAACCTTCA 
(SEQ ID NO: 24) 

7066 TTGTAGAAGGCATGATCCACCCTTTGACTTATGAGAAATGATCAGAACAGAAGAGAGAAA 
AAGACAAAAAGTAGTGCAGGCTGGCCATGGTGTCTCACACGTGTGATCCCAGCACTTTAG 
GATCCCAGCACTTTGGGTCAAGGCAGTAGGATTGCTTGAGCCCAGGAGTTTGAGACCAGT 
CTGGGCAACATGTCTAGATCTCCTCTCTACACAAATTAAAAATAGCTGGCATGGTGGCAT 
GCGCCTGTAGTCCTAGCTACTCAGAAGGCTGAGGTGGGAGGATCATTTGAGCCTAGGAGG 
[A,T] 

CAAAGCTGCAATGAATTATGATTGTGCCACTGCACTCCAGCCAGGGTGATGGAGTAAGAC 
CTTGTCTCAAAAATAAAATAAAGTAGCACAACCTCCCCAAGTTATTTTTTTCCCTCACTA 
CAACCTCCCTTCCCAGGACAGCTTAGTTAAGTTTGCATGATGCTTTACTTCTGCAGATGT 
TTGGAGGCCATGATTAAGTACCTCACACTGTGGAAGAAAGAGGAGCAGGAGGAGCCCTAT 
GTCAGCCTCACCCGAAAGAAGATGCTAATAGATGGCGAGGAGGTGCTGATAGAATGGGAG 
(SEQ ID NO: 25) 

14223 AAGGAAGTTTATTTTGTATATTTATATGATTATTAAAGTGTTACAGTATATGTTCATCAT 
GAGAAATTTAGAAAATAGAGAAATGTAGAGAAAAAGATTTCTAAAACTGATATAAGACTA 
TCACACACAAAAAAAGATATTTTGGTTCATTTTTTCAATTTTTTGTGCATCTATTTTGTT 
TTATTGTATATATTCAAGGTGTACAATGTGATGTTTCGATGTATGTACACATTGTGAAAT 
GATTACCACAACCAAACTAATTAACACATTCATCACCTCACATAGTTATCATTTTTGTAC 
[-,T,G] 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGGTAAAACTTAAGATCTACTCTCTTTAAAAATT 
TCAAGTACACAATACATTATTGTCAACTATAGTCATCATGTTGTACATTAGAGCTCTGAA 
ACTTATTTATCTTATAACTCTAAATTTGTAGCCTTTGATCAAAATCCTTCTATTTCCCTA 
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AATCCCCATCCCCTGGTAACCACCCATTCTACTCTGTTGCTAGGTGTTCAACTTTTTTAG 
ATTCCACATATAAGTAAGACAATGCAGTATTTTTCTTTATGTGTCTAGCTCATTTCACTT 
(SEQ ID NO: 26) 




16915 ATAGGCACACTCTAAGGAGAAAGTGCAGAGTAGAATTCCTTCAGGGCATAAGCCAAAATG 
ACTCTTTTTCTCAGGGACCTGCATGGGCCTCCAGCTTGTCTATTGGAATTGTTAAGTGAA 
GCCTCTCACTTAGTGCCTCATTAGCAGAGATTTCCTCCAACCCAGCTTTTCTGTGCTCTT 
GGTATTTTACTACTTGATGTGGACCTCAGAGAAGCTGAACTGTAATTGAAAATGTTTCCG 
ATGTGTGGAAGAAATGAAGACTGCTTTGTGTCTGCTGTTGTCCTGAGTATTTCATTAATG 
[-,G,T] 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTGTATGTGTGTAGGGAAGAAAGTAATA 
ATGGCTGAGACATCACCTTCATGTTGTTTGCGATTGGGATGGGTGACTAACACTCCAAGG 
TAGAGTGAAGGCAGAGGAGGGAAACAAGATCACATTAAATCATCATCAGTACTGGTTTCT 
GCCTACAGGAGTTTACTTTTTTTTTTTTTCCTTTTTTGAGATGGAGTCTCGCTCTGTTTT 
CTAGGCTGAAGTGCAGTGGTGTGATCTTGGCTCACTGCAGCCTCTGCCTCCTGGGTTCAA 
(SEQ ID NO: 27) 
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